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Abstract 

Motivated by weak convergence results in [Takahashi, A. and Yoshida, N., Monte Carlo simulation 
with asymptotic method, J. Japan Statist. Soc, Vol. 35 No. 2, 171-203, 2005.], we show strong 
convergence for an accelerated Euler-Maruyama scheme applied to perturbed stochastic differential 
equations. The Milstein scheme with the same acceleration is also discussed as an extended result. 
The theoretical results can be applied to analyzing the multi-level Monte Carlo method originally 
developed by M.B. Giles. Several numerical experiments for the SABR stochastic volatility model 
are presented in order to confirm the efficiency of the schemes. 

Keywords: Strong convergence; asymptotic method; multi-level Monte Carlo 
1 Introduction 

Wc investigate an asymptotic method that accelerates numerical schemes for perturbed random variables. 
The general concept is as follows. Suppose that F'^ is a random variable depending on a small parameter 
e. Let us consider an approximation F'^ for F'^ independently with respect to e. Then the bias F'^ — F'^ 
may be close to the bias F'^ ~ F^ , since e has a small effect on the value of F*^ — F^. Therefore, we expect 
that 

F" - F° + F° is a better approximation than F". (1.1) 

In particular, our interest is to study the above property when F"^ is a functional of a stochastic process 
and F'^ comes from time discretization for it. In many cases, F^ is a simpler model than F"^ and its exact 
distribution is well-known (e.g. Gaussian random variables or functionals of Gaussian processes). Even 
if the exact distribution of is unknown, it seems to be possible to provide a new scheme F*^ — F^ + F'~' 
with another more efficient scheme for F^ (see Section 13.11 as an example) . 
In general, we consider the following three error structures: 

• Strong error: 

E[\F' ~ {F' ~ F° + F°)\P]^/P. (1.2) 
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• Weak error: 

\E[F'] - {E[F'] - ^[i^O] + E[F'^])\. (1.3) 

• Monte Carlo bias estimator for -Jj J^jLii^'^'''' ~ F'^'-') + E[F'^] where {F'^'^)j be an i.i.d. sampling 
of F': 

M 

Notice that in the case of Monte Carlo bias ([13), the term -jj Ej=i(^""') - ^[^"] works as a control 
variates method. For applications in strong error (|1.2|) , we need an exact or accurate numerical simulation 
method for On the other hand, in the cases of weak error (|1.3p and Monte Carlo bias (|1.4p . we have 
to know the value of E[F^], and therefore we need a closed formula or an accurate numerical scheme for 
i^fF"] such as the fast Fourier transform in one dimension. 

When F'^ denotes (a functional of) a stochastic differential equation X^, F"^ corresponds to a certain 
time discretization scheme X^'*-"' (n: number of partition). Takahashi-Yoshida [15] derived the following 
results in weak error sense (|1.3p and Monte Carlo bias sense (|1.4p for the Euler-Maruyama scheme 

E[f{X^^)] ~ (i?[/(X^''"^)] - i?[/(X°'("))] + E[f{XU) = 

1 

J=l 

and hence the total error (the root-mean-squared error; RMSE) is equal to 

Vari/2(i?[/(Xf)] - -i^(/(X^^(")-^") - /(X°'(")'^")) - i?[/(X°)]) = 0(; 

i=i ' 
Here they assumed some appropriate conditions for / and the coefficients of X^. This is the case where 
F^ = f{X^) and F" = /(X^'^"^) in ([THj), and F' = /(X^'^"^) in ([13). In order to make the total error 
0(7) with weak and Monte Carlo bias, the standard Euler-Maruyama scheme with i.i.d. sampling requires 
the computational cost n ■ M = 0(7"^), and in contrast, the accelerated Euler-Maruyama scheme with 
i.i.d. sampling requires the cost 0(e'^7~^). That is, the asymptotic method (jl.ip for the Euler-Maruyama 
scheme is 0(e'^)-times faster than the standard method. Moreover, we can construct a sampling scheme 
whose computational cost turns out to be 0(e^~"'7~^(log7/e)^) for any S > and Lipschitz continuous 
function / via the multi- level Monte Carlo method (See Theorem 14. 4p . 

In this paper, we develop the error analysis for the Euler-Maruyama and Milstein schemes with the 
asymptotic method in strong sense ()1.2p . Under suitable conditions, we will show that for any p > 2, 



A/1/2 



Afi/2 



E 



sup \Xt ~ (X^(") - X°'(") + X°)rl = of 4) (1.5) 



-0<t<T 

with a = 1/2 (= 1) for the Euler-Maruyama (Milstein, resp.) scheme Xj'*'"'. Although strong convergence 
is usually very slow, the asymptotic method (|1.1[) helps to improve the speed of convergence. 

A simplest example of (|1.5p is for the case where the SDE becomes the ODE when e = 0, namely, 

dX^ = h{Xl)dt + ea{Xl)dBt. 

However, from the viewpoint of applications, we can also consider the (Oth-order) e-expansion around 
linear models like Black-Scholes (See an analytical expansion in Kunitomo-Takahashi [lOj and Takahashi- 
Yamada [14]). Indeed, we can treat a perturbed stochastic differential equations such as 

dXl = hlXldt + y^X^dBt, 
dbl = hb{hl)dt + eVb{bl)dBt, 
dal = K{al)dt + eV„{a^t)dBt. 
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Notice that becomes the Black-Scholcs model with time-dependent coefficients. Therefore there are 
many applications in the models of dynamic assets with stochastic volatility and/or stochastic interest 
rate. In particular, wc will discuss more general stochastic differential equations so-called local-stochastic 
volatility type models. 

This paper is organized as follows: Section 2 is devoted to state theoretical results for strong conver- 
gence ()1.5|) . In Section 3 wc discuss pathwise simulation of stochastic volatility models. In Section 4, wc 
introduce the multi-level Monte Carlo method and its acceleration by the asymptotic method. In Section 
5 some numerical experiments for the SABR stochastic volatility model are given. In Appendix we give 
some mathematical results including the proof of main claims in Section 2 and 4. 

2 Strong convergence results 

As seen in the previous intruduction, the asymptotic method p.ip for discretizing stochastic processes 
is very natural to speed up the discretization procedure. We state here the basic setting to discuss the 
approximation schemes. Let us consider a stochastic differential equation (SDE) of the form 

dXI = b{Xl,e)dt + (j{Xl,e)dBt, X^ = xq, (2.1) 

where b S C(R^ x [0, 1]; R^), cr e C(R^ x [0, 1]; R^ x R'^), and Bt is a d-dimensional standard Brownian 
motion on a probability space (fi, J^, P) with a filtration {J-'t)t>o satisfying usual conditions. Throughout 
the paper, we use the equidistant partition ti = ^T, < i < n. The Euler-Maruyama and Milstein 
schemes will be considered with some smoothness conditions for the coefficients of the SDE. 



2.1 The Euler-Maruyama scheme with asymptotic method 

Let Xj''"' be the Euler-Maruyama scheme for the SDE X^ (Maruyama [l^): For t e [ti,ti+i], 

^eXn) _ ^.,(n) ^ ^(^^^.(") ^ _ + a{Xlf^\e){Bt - Bu). (2.2) 



The implementation of (|2.2p is very simple. Indeed, practitioners only need to know how to simulate 
normal random variables. The error of the scheme has been analyzed deeply by many researchers (see 
e.g. [K], [S], [1]). Roughly speaking, the strong order of convergence is equal to 1/2. and the weak order 
is equal to 1. 

We now prepare the assumptions for X. 

{H^y. \b{x,e)\ + \a{x,e)\<C{l + \x\). 

{H2): \b{x, e) - b{y, e)| + \a{x, e) - <7{y, e)| < C\x - y\. 

(Hs): \b{x, e) - bix, 0)| + \aix, e) ~ a(x, 0)| < Ce(l + \x\). 

{H4): For every e, &(•, e), cr(-, e) e and \dbix, e) - db{y, 0)| + \d(7{x, e) - da{y, 0)| < C(e -t- |a; - y\). 

The above constant C is independent of {x,y,e) e R^ x R^ x [0, 1]. 
Let us define the accelerated Euler-Maruyama scheme as 

The property (jl.ip for strong convergence is formulated rigorously as follows. 

Theorem 2.1. Suppose that {Hi)-{H4) hold. Then for any p > 2, there exists a constant C ~ C{T, xo,p) 
such that 



E 



sup \xt-Y; 

0<t<T 



1/p e 

<c 



1/2- 
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In particular, if we consider the small volatility model dX^ = h{X^)dt + edBt and the ODE dX^ = 
b{X^)dt, then intuitively speaking, {X^—X^) — {X^—Xf) cancels out the error from the drift term (except 
the effect of e), and the error from small volatility e only remains. Hence the total error is proportional 
to e. 

Of course, more general situations can be considered, for example, if b and a depends on time t, then 
some smoothness assumptions with respect to (t, e) are needed in addition to {Hi)-{H4). We will not 
attempt to prove this, but basically the asymptotic method works as well. 



2.2 The Milstein scheme with asymptotic method 

We next discuss the Milstein scheme which has a higher order rate of convergence than the Euler- 
Maruyama scheme in strong sense. Just for notational convenience, we only consider the case d = 1. 
Of course, in general dimensional setting with commutative vector fields (cr^)i<j<rf, we can use the 
(acceraleted) Milstein scheme as well. 

Throughout this section, we assume the following smoothness. 

• For every e, cr(-,e) G C^. 

The Milstein scheme X^'*-"^ for the SDE X^ is defined by 

^e,(„) ,^ ^e,in) ^ ^(x^^, (") ^ ,) _ + a{Xtf''\ e)iBt - B^) + a a' {Xl^}-^\ e) j' £ dBM 

= Xlf-^ + b{Xl^}'^\e)(t - U) + a{Xl^}-\em - Bt,) + l^a' {X:f"\e){{B, - B,,)' - (t - U)) 

for t e [ti,t,+i]. 

We use the (stronger) assumptions for X . 

{H[): {Hi) & \a<j'ix,e)\ + |6a'(x,e)| + \<J^ a" {x , e)\ < 0(1 + \x\). 

{H',): {H2) & \aa'ix, e) - aa'{y, e)\ + |6a'(x, e) - ba'iy, e)| + \a^a"{x, e) - a^a"{y, e)\ < C\x - y\. 
{H'^y. {H3) & \aa'{x,e)-(Ta'{x,0)\ + \ba' {x , e) - ba' {x , 0)\ + \a^a" {x, e) - a^a" {x,0)\ < Ce(l + |a:|). 
(Hi): [H,) & |(aa')'(x,e)-(aa')'(2/,0)| <C(e + |x-j/|). 
Let us define the accelerated Milstein scheme as 

Then we can get the higher order convergence rate. 

Theorem 2.2. Suppose that [H[)-{H'^) hold. Then for any p > 2, there exists a constant C — C{T,xo,p) 
such that 



E 



sup |x,^-y/'^"^r 

0<t<T 



1/p e 
< C-. 
n 



3 Application to pathwise simulation of stochastic volatility models 

Our goal in this section is to construct a faster pathwise approximation for perturbed stochastic differential 
equations which appear in financial modeling of volatility. 
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Figure 1: A sample path of discretized SABR model when v is small. 



3.1 An accelerated scheme for SABR model 

In financial modeling, the SABR model plays a role to fit the implied volatility especially in short time. 
The model is given by the SDE (Hagan et al. [8]) 

dSt = ^tS^dBl 

dat = uatipdE] + \J\ - p^dB\). 

The volatility is not a mean-reversion process, hence this model does not suit for pricing long-dated 
options. If /? < 1, as far as the authors know, there is no exact pathwise simulation method for the above 
SDE. In weak sense, several accurate simulation methods via Bessel processes are known. 

To avoid that the volatility process at becomes negative in approximation procedures, we use a 
logarithmic transform for a^. 

dSt = \/ ao cxp{at)StdB} 

ddt = — ^dt + v{pdBl + - p'^dB^). 

Consider e = f. Since we do not know exact pathwise simulation methods for S^, we substitute the 
Milstein scheme for S^. Therefore, we can use an 0(-^ -f i)-scheme defined by 

:= St - + si 

When v is small enough, a typical sample path is like Figure [1] Here we use n = 16 for the standard 
Euler-Maruyama scheme (Standard E-M) and the above accelerated scheme (Accelerated). 
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We next turn to consider another formal approximation scheme. Formally, ~ x when /3 w 1 and 
especially x 1. Thus consider the scaling Lt := St/So, (3 ~ P{e). 



Since is a log-normal process, it is useful to compute the path 1 1— )• and E[f{X^)]. We will check 
the efficiency of Y through a numerical test later. 

3.2 General stochastic volatility models 

The following model is an extension of local-stochastic volatility models applicable to both short and long 
term contingent claims in financial markets. 



where Jt is a compound Poisson process, which is often used to adapt especially short-dated large volatility 
smile/skew. 

We remark that it is difficult to fit short-dated volatility smile/skew under the Heston model (/3 = 1, 
7 = 1/2, Jt = 0), and then v can take very large value. On the other hand, under general models with j3 
and Jt, the parameter v need not to be so large. 

Let {tj} be the random jump times associated to Jt and consider a new time partition {tk\ '■— 
{ti\ U {tj}. On the time interval we can regard the approximation problem for St as the one 

for a continuous SDE. In particular by taking e = i/ = 0, the model becomes the CEV model with 
time-dependent coefficients. For a technical reason, we should consider some carefuU treatments around 
zero of the function (•)'^ (See [31 El [11]). 

4 Application to multi-level Monte Carlo method 

The theoretical results we obtained in previous can be applied to the multi-level Monte Carlo method 
(MLMC in short). We propose an accelerated Monte Carlo sampling for Takahashi-Yoshida's weak 
convergence method. 

4.1 The basic methodology of MLMC 

We forget the parameter e for the time being, and denote by Xt the continuous SDE Xf defined by p.ip . 

Let us define P := /{Xt) and Pi := /(X^"''), and consider the time-step size T/ni — T/k^ for a fixed 
fc G N. Let L e N and the sampling of multi-level Monte Carlo is defined by 




even 



Yt^:=St-S,m-L't). 



dSt = ^lStdt + ^S^dBl + St-dJt 



dat ^ X{e ~ at)dt ^ uaJipdB] + ^Jl - p^dB^) 



L 




(4.1) 



where each Yi is independently distributed and is given by 
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with i.i.d. sampling Pq-''' or (P/ — -P/-i) 



1, . . . , Ni. The most important point is to use the same 



Brownian motion path {Bt)t>o for simulating Pi and P;_i, and so the concept of the multi-level Monte 
Carlo method concerns the strong (pathwise) convergence rate. 
Clearly we show that 

E[P] - E[Y] ^ E[P] - E[Pl], 

therefore the weak rate of convergence depends only on the last number L. Moreover, we obtain from 
the independence of (Yi), 

Var(y) = ^— Var(ll). 



and by definition Var(y;) 
can obtain 



Var(Pi — P;-i). Suppose some suitable conditions for / and Xt- Then one 



E[Pi-P] = 0{llni), 

Y&?'^{Pl - P,_i) < IIP; - PII2 + ||P,-l - PII2 

= 0{l/n]'^). 

The last estimate is the strong convergence result in L^, which is discussed in this paper. 

The total computational cost C is determined by the level L, the number of sampling {Ni)fLQ, and 
the number of partition n;(= fc') so that 



ni. 



1=0 



Suppose that the required RMSE is 0(7). Then by choosing Ni = 0(7^^L7ij~^), the total variance 
Var(y) is of 0(7^). Now if we set L = log(7~^)/ log(A:) + 0(1), then the total time discretization error 
E[Pl — P] = 0(7). Consequently C = 0(7^^(log7)^) for the required accuracy 0(7). 



4.2 Accelerated MLMC sampling (with smooth payoffs) 

From now on, we reconsider the sampling of the accelerated Euler-Maruyama scheme introduced by 
Takahashi and Yoshida from the standard Monte Carlo method 

i=i 

to the multi-level Monte Carlo method via 

pnew ._ _ + E[f{X^)]. 

Giles [5] assumed that / is Lipschitz continuous to analyze the variance of estimator. On the other 
hand, we need f G := {g £ C^(R^;R) | dig and dijg are bounded, 1 < i,j < N} in order to use 
the asymptotics with respect to e (Notice that \f{xi) — f{yi) + I{V2) ~ !{x2)\ ^ C\xx — yi + y2 — in 
general). Our analysis follows from the lemma below. 

Lemma 4.1. For f e Cl, 

\f{xi) - f{yi) + f{v2) - f{x2)\ < ^^^^(ki - X2I + |yi - y2\)W -yi\ + l|V/||oo|a:i - yi + 2/2 - 2;2|. 

Proof. This can be proved immediately by using the mean value theorem twice (See also Lemma lA.3p . □ 
Then we have the following variance estimate. 
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Proposition 4.2. Assume that {Hi)-{H4) hold. For f G C'^ , we have 
Proof. The variance of the difference P^^™ — Pt-T is estimated as 

Var^/^^pncw _ pncw-^ < - /(X^'^"')) - (/ (X^' ^ ' ) - ^ ) ) || 2 

Thus by Lemma l4.1i 

WfiX^) - + - /(X0)||2 

< C{{\\X^ - x°\u + \\x'/"^^ - \U)\\X^ - X^''"''||4 + \\x^ - x^''"') + - |i2). 

Hence using Lcmma rA.HIA.2l and Theorem I2.fl wc get the rate of convergence. □ 

For the use of the multi-level Monte Carlo method, we have obtained the results as follows. 

S[Pi"™ - P] = 0(e/nO, 
Var^/2(pnow _ pnew^ ^ 0{e/n/^). 

So the estimator for {Pl^™)i>o has an equivalent effect to the one for {Pi)i>o with the required error 
0(7/e). Consequently wc get the order of computational cost 0(e^7~^(log(7/e))~^). Both the asymptotic 
method and multi-level Monte Carlo method are very easily computable, so that practitioners will get 
large benefit only with small additional implementation cost. 

Remark 4.3. Clearly, we can also check the variance estimate for the accelerated Milstein scheme. Let 
pncw _ f{X°'''"'^) + E[f{X^)]. By a similar argument, we derive that under {H[)-{H'^) and 

Vari/2(pncw _ p^ricw) < CeuY^. (4.2) 

We have not obtained weak convergence results for the accelerated Milstein scheme yet. However, we 
guess that from the basic proof of Takahashi-Yoshida [TB] , it holds that 

E[f{X^^)] - (P[/(X^'("^)] - P[/(X°'("))] + = 0(1) (4.3) 

under some smoothness conditions for / and the coefficients of X^. Thus combining the results (|4.2p . 
(|4.3p and the discussion in Giles [HIE], we finally conclude that the total computational cost is 0(e^7^^). 

4.3 Lipschitz payoffs 

Let us consider the first component (X^)^^^ as an asset dynamics. Our interest is pricing an option 

/((X^)(i)) with Lipschitz payoffs /: R ^ R. Set P;"™ = /((X^''"'^)(i))-/((X°'^"' V'^)+£^[./((^t)*'^)]- 
Then we can obtain an upper bound estimate as follows. 

Theorem 4.4. Assume {Hi)-{H4) and / : R — > R is a Lipschitz continuous Junction whose weak 
derivative has bounded variation in R. In addition, suppose (X^)^^-* has a bounded density, and [X^]^^^ 
also has a bounded density uniformly with respect to n. Then we have for any small 6 > 0, 

Vari/2(P,"°™ - PjIT) < Ce(^-^)/2„-i/2_ 
Proof. See Appendix C. □ 

This theorem implies that the required computational cost turns out to be 0(e^^''7^^(log7/e)^), with 

L = log(e7-i)/log(/c) -hO(l) and Af; = 0(ei-S~^iV^)- 

Wc now summarize strong rate of convergence for P; — P;_i and p^'^™ — Pj'^J^ in Table [TJ 
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Payoff 


Standard E-M 


Accelerated E-M 


Cl 






Lipschitz 




0(e(i-*)/2„-i/2) 


Digital 







Table 1: Strong rate of convergence of Pi — Pi-i and p^^™ — p^ 



4.4 Localization for irregular payoffs 

The regularity of / seems to be essential for the accelerated MLMC method introduced in previous. 
For example, we will see through computational experiments that the acceleration with discontinuous 
functions / does not work so well. 

We now propose a localization technique for this problem. Let us define a decomposition 

/ — fs~^ fir 

where fs is a smooth (at least Lipschitz continuous) function with f ~ fs- Then we apply the accelerated 
MLMC to the smooth part fs and the standard MLMC to the irregular part fir- In other words, we 
consider the MLMC method for 

Pr'^ifs) fiX'r) - fs{X^T) + E[fs{X^T)]- 
The standard MLMC for discontinuous functions was studied in Avikainen [T] and Giles et al. [7]. 

5 Simulations 

5.1 Numerical experiments for SABR model 

Li this section, we want to study an estimator of L^-norm {^f X]^ii(^T'' ~ F^'*'"''"')^)^/^ for the SABR 
model. As a reference path, we use 

^e^,(»rof) ijjg^fjad oiX^. 

We set the parameters as follows. 

• 5o = 100, P = 0.9, ao = 0.16 x Sl'^^~^\ v = 0.1, p = -0.6, T = 1 

• riref = 2^4, n = 8, 16, 32, 64, 128, 256. 

Here we considered a scaling for ao (via St « SQ~^St). The number of simulation M for the test is 10^. 
The results are given in Figured The accelerated scheme is faster than the standard method in both 
cases of L^-error. 

We next study the case with saveral Let us compute the L^-error ratio for a random variable Z 
which is defined as 

— rV X % ■ 

s[|4"-'')-5(,"^|2]i/2 

We fix the other parameters in the previous. In Figure [31 we can check the efficiency of the asymptotic 
method (only) when h' is small enough. 

Finally we compare Y and Y with different /3. Figure |4] shows that the efficiency of Y is very close 
to that of y as /? sa 1. Therefore if /3 « 1, we can apply the analytical tractability of Y to pathwise 
simulation, computing expectations, or so on. 
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Figure 2: L^.en-or : -iT"¥]^/^ for the left and E[ma^o<i<n,^, "'^ - f/"^ |2]i/2 for the right. 

5.2 Numerical tests for MLMC 

To show that the accelerated method is more efficient than the standard method with MLMC, we take 
a numerical test for £;[P,"™ - P] = 0{e/ni), and Var^/2(P,"'=^ - = 0{e/n]^'^) under the SABR 

model with small parameter v. Let us consider payoff functions (European and digital options) 

/(.t) = max(0, X - 100) or f{x) = l{^„ioo>o} 

and the parameters 

• 5o = 100, ^ = 1, ao = 0.16, u = 0.1, p = -0.6, T = 1 
The level structure of MLMC is given by fc = 4, i.e., ri/ = 4'. As a localization for digital option, we use 

fs{x) = {ma.x{x ~ 100 + h,0) - ma,x{x - 100 - h, 0))/2h. 

Here we set h = 1.0. 

Figure [5] and [6] show the numerical results. We used the number of simulation AI = 10^ for the left, 
and M — 10^ for the right. The results basically imply that the accelerated method works better than 
the standard one as in preceding numerical experiments. Remarkably the accelerated method performs 
worse in the case of variance estimates for digital option, likely due to discontinuity of the payoff function. 
In contrast, the localized scheme (AcceleratedJoc) performs better than the others to some extent. We 
note that for general 1/2 < /3 < 1, the (semi-)analytical formula for CEV option pricing model can be 
used in order to compute E[f{Sj^)] (See [13]). 

6 Discussion: small extension 

By the accelerated method, the Euler-Maruyama scheme can be faster with a parameter e small enough. 
But of course, if e is not so small, the method does not work effectively (See Figure [3]). 
To improve the efficiency of the method, we consider a natural extended scheme 
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Figure 3: L^-crror ratio for Y with different ly. 
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Figure 4: L^-error ratio for Y (schemel) and Y (scheme2) with different 
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Figure 5: European option: (Left) A comparison of weak convergence between Pi and (Right) A 

comparison of standard deviation between Pi — Pi-i and p^"''* — Pi^™- 
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Figure 6: Digital option: (Left) A comparison of weak convergence between Pi and Pl^™. (Right) A 
comparison of standard deviation between Pi — Pi-i and P^"™ — P^™- 
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where pe = l + 0(e) to keep the rate of convergence. For example, the optimal with respect to L^-error 
is given by 

:= argmin^[(^^^ - + piP" - F"))^] 
p 

_ E[{F'' - F'){F° - F°)] 
- E[{FO - F0)2] 

However, since the value of F*^ is unknown, we can not estimate PeiL"^) directly. In general it does not 
seem easy to choose an appropriate and computable p^- This issue is left for future work. 



A Proof of Theorem [Hi] and [272] 

We use the following notations. 
• 77(3) := U if s e [U,U+i). 

We will apply the Burkholder-Davis-Gundy (BDG) inequality 

CpE[{Mf^/^] < E[ sup \Mtf] < CpE[{MfJ^] 

0<t<T 

to the proofs below: Here p > and Mt is a continuous local martingale. 

Using the BDG inequality and Gronwall inequality, we can show the following moment estimates. 
(See [S] for the proof in the case of L^-norm.) 

Lemma A.l. (i) Suppose that the assumptions {Hi)-{H2) hold. Then for any p > 2, we have 

sup E[ sup \X^\P] + sup E[ sup \X^\P] < 00, 
ee[o,i] o<t<T ce[o,i] o<t<r 

sup max E[ sup \X^ ~ XI\p] < C{T,xo,p)/n^^^. 

eg[0,l]0<i<n-l t.<t<t, + i 

sup E[ sup \X't~Xtn<C{T,xo,p)/n^/^. 

ee[0,l] 0<t<T 

(a) Suppose that the assumptions {H[)-{H2) hold. Then for any p > 2, we have 

sup E[ sup \Xt\P] < 00, 

eG[0,l] 0<t<T 

sup E[ sup \X^-X^\P] <CiT,xa,p)/n. 

ee[04] 0<t<T 

We now give an important lemma for the proof of the main theorems. 
Lemma A. 2. (i) Under {Hi)-{H3) , we have for any p > 2, 

i/p 



E 



\xf-x?\p 



0<t<T 



(A.1) 



max E 

0<i<n-l 



1/p 



- ti<t<ti + i 

(ii) Under (iJi)-(iJ3), for any p > 2, 



sup \X^,-{Xl-Xl+X^)\P <CiT,xo,p)- 



/2- 



E 



l/p 



sup \Xl~^\P <CiT,xo.p)e. 



0<t<T 



(A.2) 
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(Hi) Under {H[)-{H'^), for any p > 2, 



E 



sup \X^-X?\pY^' <C{T,xo,p)e. 

0<t<T -I 



Proof, (i): Wc first note that 

X| - X? = \\h{Xl e) - b{Xl 0))ds + f\<j{Xt,e) - a{Xl 0))dBs, 
and by the BDG inequahty for the stochastic integral term, 
E[sup \Xt - X'^IP] < cJeU f\b{Xt,e)-b{XlO)\dsY]+E\{ f\a{Xl,e) ~ a{X^ ,0)fdsY^']) . 

0<s<t ^ L J L J/ 

Using the conditions {Hi)-{H3) for the above, we have immediately 

G{t) E[ sup \Xl - X^\P] < CieP + C2 f G(s)ds. 

0<s<t Jo 

Here the constants Ci and C2 do not depend on e. Thus from the Gronwall inequality we obtain (jA.ip . 
We next consider the second result (jA.2[) . Since 



X^ - - + X") = / {b{XI, e) - biXl 0))ds + / {a{XI, e) - a(X°, 0))dS„ 



ri(t) 



□ 



the inequality (|A.2p follows from (i?2)-(-ff3) and (|A.1|) . 

The proofs for (ii) and (iii) arc straightforward as in (|A.1|) . 

The following lemma will be used such as the Lipschitz continuous property. 

Lemma A. 3. (i) Assume that {Hi)-{H4) hold. Then 

|5(a;i,e) - b{yi,e) + b{y2,0) - 6(2:2, 0)| < C((e + \xi - X2I + \yi - y2\)(xi - yi) + \xi - ?/i + 2/2 - X2\). 
\a{xi,e) ~ (T{yi,e) + (j{y2,0) - cr{x2,0)\ < C((e + \xi - X2 \ + \yi - y2\){xi ~ yi) + \xi ~ yi + y2 - X2\). 

(ii) Assume that {H[)-{H'^) hold. Then 

\aa'{xi,e) - aa'{yi,e) + aa'{y2,0) - crcr'(x2,0)| 
< C((e + \xi - X2\ + \yi - y2\){xi - yi) + \xi - yi + y2 - X2I). 

Proof. We only prove for b. By the mean value theorem, 

e) - 6(yi, e) + 6(2/2, 0) - 6(2:2, 0) = i^i - Vi) + Cy^^M - ^2) 
where y :— db{px+ (1 — p)y, e)dp = ^. Taking the difference again in the right hand side, we have 

Cx,,yA^i -yi) = iCuyi ~C2,y2)i^i - yi) + ^,^2(2^1 -yi)- 



Finally, using the assumption (^4) for (^J^ ~ y,_^), we obtain the result. 

Now wc shall prove the theorems. 
Proof of Theorem \2.1l Let us define 



□ 



Gi(t):=i?[sup |X:-y/'(")n. 

0<s<t 
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By using the Gronwall inequality, our goal becomes to show the following: 

GAt) < C^-^ + C2 Gi{s)ds 

where Ci and C2 depend only on T, xo,P- 
We now compute 

where 



Jo 



and 

e-(i) = A6(X|,e)-6(X^(,),e) + fe(XO(,),0)-fe(^°,0))ds 
Jo 

+ / (a(X:,e)-a(X;,),e)+a(X,';(,),0)-a(X°,0))di?.. 
Jo 

For e'^(t), we obtain from Lemma lA. 31 

e) - 6) + b{X^^^^^,0) b{Xl 0)1 < C((e + - + - X^^,, - X^^,,! 

and 

\a{Xt, e) - a(X^(,), e) + a(X«(,),0) - 0)| < C{{e + \XI - X^\ + IX^^^^^ - X^^,)\)\XI - 

Hence the integral term in e'^(i) is evaluated by 

i?[sup I r(6(X:,e)-6(X^(,),e) + 6(XO(,),0)-&(XO,0))dsn 

0<r<t JO 

< C3((eP + 2 sup \\Xt-XXp) sup H^l - Ij^^ + sup 

^ 0<s<t 0<s<t 0<s<t ^ 

By using the BDG inequality, the stochastic integral term in e'^{t) also has the same bound (except the 
size of constant C3). Consequently, we have by Lemma rA.lllA.2l 

i?[sup |e-(s)n<C4-^. 
o<s<t nP''^ 

Applying a similar calculus to e'^{t), we also get 

E[ sup \e^{sm <C,^ + C, f - - + XO(,))r]ds 

0<s<i '1'^' Jo 



This finish the proof of Theorem O □ 
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Proof of Theorem Similarly to the proof of Thcorcm[2JJ wc shall show that for 62 (t) := i?[supg<j,<( \xt. 



G2it)<C\^ + C2 f G2{s)ds. 
nP Jo 

Now we consider the decomposition 

i=l 

where 

e\t) = /*(6(X;,(,),e) - + 6(X,';(,),0) - h{X^^^^yQ))ds 

, e) - a(X^(,) , e) + a(X°(,) , 0) - , 0))dS, 

/ e) - aa' e) + aa'(l°(,), 0) - aa'(XO(,), 0))dB,di?„ 

and 



e\{t)= [ f {ba'{X'^,e)~ba'{X^,0))drdB„ 

Jo Jrj{s) 
ft rs 



f r (laV'(X;,6)-ia2a"(X0,0))drdi?„ 

Jo Jrt{s) ^ ^ 
t ps 

U vf- ^\ ^\ I „„iivO n\ „„ilvO 



em 

el{t)= I I (aa'(X,^e)-aa'(X^(,),e)+aa'(X;;(,,),0)-aa'(X;^0))dS,dB,. 

Jo J-q(s) 

By a similar manner as in the proof of Theorem I2.1[ we can also obtain 

E[ sup {\e\s)\P + \e\{s)n <C,^ + C,f G,{s)ds. 
o<s<t ^ nP Jo 

Indeed, compared with Theorem l2.11 the reason why we can get the rate n~P above is due to the additional 
integrals /^(j,) ■ dr or ■ dBr inside the error terms. □ 

B Proof of Theorem 14.41 

Throughout this section, we use the following notations without confusion: 

. ||/||Lip:=inf{A'>0: |/(.T)-/(y)| </C|a;-y|, foranx,yeR}. 
• II/IItv sup_^<^^<...<^_^<^ ^Jl^ 1/(2::^) - 

We say / has bounded variation in R if H/Htv < 00. 

The following lemma plays a crucial role in the proof of the theorem. 

Lemma B.l (Avikainen [1], Theorem 2.4). Let X and X be real valued random variables with X, X € 

!)• addition, suppose X has a hounded density. Then for any function f of hounded variation in 
R and q>\, there exists a constant C > depending on p, q, and the essential supremum for a density 
of X such that 

\\f{X) - f{X)\\, < C||/||tv||^ - X\\^. 
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By the next lemma, we can obtain an approximation sequence of the payoff /. 

Lemma B.2. Let f he a bounded Lipschitz continuous function whose weak derivative has bounded 
variation in R. Then there exists a sequence {fj)j>i C C^{IV) such that 

11/ - /jiloo ^ 0, as j ^ oo, 
ll/jlloo < ll/llLip for all J >1, 
ll/jllTV < II/'IItv for all j> I. 

Proof. The approximate sequence can be constructed by moUifier convolutions fh := (/ * 0^), that 
is, (jjh '■= Ji4>{ji) with the conditions (i) G C°°, (ii) supp((/)) C {|a;| < 1}, (iii) > 0, and (iv) 
Jj^ 4){x)dx = 1. □ 

Proof of Theorem \4-4\ Assume that / is a bounded function whose derivative has bounded variation. 
As seen in Proposition note that 

\\f{X^)~f{X^) + f{X^)-f(X^)h 

< II f\fipX^ + (1 - p)X^) f'ipX^T + (1 - . - (X|,)(i))||2 

+ II /II Lip II ATy — X^ + ~ A"'^||2- 

The final line is bounded by ||/||Lip x 0(e/n^/^), and thus we turn to focus on the estimate for the second 
line. The second line is bounded by 

II f\r{pX^ + (1 - p)X^) ~ f'ipX^ + (1 - p)X^))dp\\2p\\XT - X^hg 



^ I' wifip^T + (1 - p)^t) - fipx^T + (1 - p)xm2pdp 

for any p,q > 1 such that 1/p + 1/q — 1 . Now using Lemma IB. 11 we have 

Wifipx^ + (1 - p)x^) - fipx^ + (1 - p)x^))hpdp 



<Cp,r\\f'\\TY / \\{pX^ + {l-p)X^)-ipX^ + {l-p)X^)r//-'''dp 

Jo 

< Cp,r,T,xo\\fhv e^^^. 

To obtain the result, we choose small p > 1 and large r > 1 such that 2p{r+i) > 5 ~ 

Finally, for general /, consider fK '■— (/ A K) V (—A') for if > as a first approximation, and apply 
Lemma FB. 2 1 to fK- Then we obtain the desired result by taking the limit. □ 
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